Components in Dynamic Video Content
نویسندگان
چکیده
The fast expansion of Internet and DVB channels has brought a fast increase of video footage which needs to be indexed for efficient and easy retrieval. This task has been historically done by documentalists who tag manually each video with a few keywords, unfortunately such work is time consuming and hence very expensive. In the last decade much effort has been put into building processes which automatically assign content-based labels to video documents, a proof of this is the existence of the TRECVid Video Retrieval Evaluation [1] workshops since 2003. Structural similarity metrics for still images has been largely studied lately, the goal of these is discovering underlying structure of an image that is impervious to rotations, translations, resizing and other transformation. This way images can be easily compared without being affected by their different scales and any kind of intermediate processing that they have experienced. The question tackled in this work is whether something analogous can be made for video, so similar videos can be detected independently of their size, frame rate and image content. Video indexing for retrieval is an old concept, first approaches date from the first half of the nineties. Back in 1994, Smoliar et al. [2] already stated the necessity for video software to identify and represent video content for indexing and retrieval. In parallel to video indexing, video classification has also brought much attention, while retrieval focus on finding videos in a database that match a given query, classification puts all the input videos into predefined categories, which are labeled. There are many ways to address these issues, mainly there have been three fields of research. One is text-based approach, which is based on identifying text objects and processing them with optical character recognition or extracting text from closed-captions, like did Wei Qi et al. [3] to automatically categorize news stories. Another approach is using audio features, processing the data can be made in time domain (energy, zero crossings): E. Wold et al. [4], Z. Liu et al. [5]; or in the frequency domain (bandwidth, frequency centroid, pitch): U. Srinivasan et al. [6]; a strong point in this case is the maturity of audio processing techniques. Finally, the third field of research is using visual information, which attracts a great deal of interest as most of the information processed by humans comes from their vision and because despite the efforts, the Human Visual Systems …
منابع مشابه
An Efficient Hierarchical Modulation based Orthogonal Frequency Division Multiplexing Transmission Scheme for Digital Video Broadcasting
Due to the increase of users the efficient usage of spectrum plays an important role in digital terrestrial television networks. In digital video broadcasting, local and global content are transmitted by single frequency network and multifrequency network respectively. Multifrequency network support transmission of global content and it consumes large spectrum. Similarly local content are well ...
متن کاملContent-based VBR Video Tra c Modeling and its Application to Dynamic Network Resource Allocation
In bandwidth limited networks and network interfaces, dynamic resource allocation can substantially increase the link utilization and also decrease the required network bu ering. In general, there are two important tradeo s e ecting the network utilization. First, tradeo between e ciency of the real-time dynamic resource allocation and the policy controlling the renegotiation frequency. Second,...
متن کاملA Specification Language for Dynamic Virtual Video Sequence Generation
The FRAMES project is developing a system for video database search, content-based retrieval, and virtual video program synthesis. For dynamic synthesis applications, a video program is specified at a high level using a virtual video prescription. The prescription is a document expressed in a semi-formal language specifying the video structure, including formal specifications for direct referen...
متن کاملAutomatic Annotation of Formula 1 Races for Content-Based Video Retrieval
Content-based video retrieval is emerging as an important part in the process of utilization of various multimedia documents. In this report we present a novel system for the automatic indexing and content-based retrieval of multimedia documents. We chose the domain of Formula 1 sport videos because the manual annotation of Formula 1 races is complicated and time consuming. Our system uses mult...
متن کاملCompressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملGeneric Viewer Interaction Semantics for Dynamic Virtual Video Synthesis
The FRAMES project is developing a system for video database search, content-based retrieval, and virtual video program synthesis. For dynamic synthesis applications, a video program is specified at a high level using a virtual video prescription. The prescription is a document specifying the video structure, including specifications for generating associative chains of video components. Associ...
متن کامل